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Abstract 

Snoring is extremely common in the general population and when irregu- 
lar may indicate the presence of obstructive sleep apnea. We analyze the 
overnight sequence of wave packets — the snore sound — recorded during 
full polysomnography in patients referred to the sleep laboratory due to sus- 
pected obstructive sleep apnea. We hypothesize that irregular snore, with 
duration in the range between 10 and 100 seconds, correlates with respiratory 
obstructive events. We find that the number of irregular snores — easily ac- 
cessible, and quantified by what we call the snore time interval index (STII) 
- is in good agreement with the well-known apnea-hypopnea index, which 
expresses the severity of obstructive sleep apnea and is extracted only from 
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polysomnography. In addition, the Hurst analysis of the snore sound itself, 
which calculates the fluctuations in the signal as a function of time interval, 
is used to build a classifier that is able to distinguish between patients with 
no or mild apnea and patients with moderate or severe apnea. 

Keywords: snore, Hurst, time interval, OSA 
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1. Introduction 

Temporal patterns of natural events can be used to detect hidden dy- 
namics [lj. In fact, this connection dates back to Hurst's work j2] on the 
long-term memory (i.e. the autocorrelation) of a time series. This approach 
has been used to analyze respiratory sound events, and for instance the time 
interval between consecutive crackling sounds, generated when a collapsed 
airway opens, has been related to both lung structure and surface properties 
of the lung coating fluids [3j H] . The dynamics of various other physiological 
signals has been investigated with the help of tools derived from information 
theory and statistical mechanics (see e.g. [HJ [B] and references therein). 

Snoring is perhaps the best known respiratory sound, and is characterized 
by a loud sound with frequency content between 20 and 300 Hz. It is caused 
by the vibration of soft tissues from the upper airway, including soft palate, 
uvula, tonsils, tonsillar pillars, base of tongue, lateral pharyngeal walls and 
mucous membranes J7J. Snoring occurs during sleep, when the pharyngeal 
muscles relax and may block the airway. Snoring is common in the general 
population, and does not necessarily indicate a disease. In contrast, irregular 
snoring is one of the hallmark signs suggestive of Obstructive Sleep Apnea 
(OSA). OSA is the most common sleep-disordered breathing, and is charac- 
terized by repetitive events of upper airway obstruction during sleep, causing 
total or partial cessation of airflow (apneas and hypopneas, respectively) [5] . 

A respiratory event is defined as apnea or hypopnea if its duration exceeds 
10 seconds. Apnea is characterized by cessation of airflow, while hypopnea 
is defined as a substantial reduction - of at least 50% - in airflow. OSA 
is a major public-health problem due to the high prevalence in the adult 
population (ranging from 4% to 10%), and due to the poor outcome when 
not recognized and treated. OSA is associated with recurrent asphyxia, frag- 
mented sleep and generation of negative intrathoracic pressure during futile 
efforts to breath (9]. Although the patient is by and large unaware of these 
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recurrent respiratory episodes during sleep, the consequences are multiple 
and may include excessive daytime sleepiness, fatigue, poor cognitive func- 
tion and quality of life, as well as increased risk of motor vehicle accidents, 
metabolic and cardiovascular diseases. Full polysomnography is the gold 
standard method for OSA diagnosis, and the most important index express- 
ing the severity of OSA is the apnea-hypopnea index (AHI), which is the 
average number of apnea or hypopnea events per hour. Sleep apnea is usu- 
ally classified into mild, moderate or severe, depending on whether the AHI 
lies between 5 and 15 (mild), 15 and 30 (moderate), or above 30 events per 
hours (severe) [TU]. The major limitation of polysomnography is that it is an 
expensive and laborious test, which requires an overnight sleep in a special- 
ized laboratory under the supervision of a sleep technician. Therefore, the 
development of alternative and simpler methods for the diagnosis of OSA is 
a priority [11] . 

Irregular snoring and witnessed apneas are the major signals indicating 
OSA and there is an increasing amount of research trying to relate OSA with 
snoring patterns. Most methods in the literature try to establish a correlation 
between OSA and either the number of snore events or the wave features of 
the snore such as intensity, sound frequency, number of snore events, spectral 
density, or a combination of these features [121 1131 E] • Recently, Cavusoglu 
et al. [IHl HE] proposed the use of sequential properties of snoring episodes 
for OSA identification. They built sequences of snoring episode duration, 
snoring episode time separations and average snoring episode power, and 
were able to show that, using statistical properties of these sequences to 
measure snore regularity, OSA patients exhibit a greater degree of irregularity 
as compared with simple snorers. Following a related path, in this work 
we focus on the dynamics of irregular snoring episodes, and define a new 
Index, called the Snore Time Interval Index (STII), showing a strong positive 
correlation with the well-known apnea-hypopnea index. Furthermore, by a 
fluctuation analysis of the snore sound, through Hurst's R/S analysis, we 
build an automated classifier which is able to distinguish between patients 
with no or mild OSA and patients with moderate or severe OSA. In a similar 
spirit, a number of studies focused on the study of dynamic properties of sleep 
electroencephalogram (EEG) signals and their relation to OSA [T71 [181 US]- 
Here we work directly with acoustic signals, whose acquisition, in contrast 
with EEG signals, requires very simple equipment and no direct contact with 
the patients. 

The remaining of this paper is organized as follows. In section [2] we 
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Figure 1: Sound intensity versus time, showing the time intervals between consecutive 
detected snore events. 



describe details regarding data acquisition and processing, and introduce 
the mathematical definitions used in this work. In section [3] we present the 
results, and the paper ends in section [4| in which we provide a discussion 
and our conclusions. 

2. Methods 

2.1. Experimental and pre-processing details 

We analyzed snore records from patients referred to the sleep laboratory 
due to suspected OSA. The patients slept in a bed at the Sleep Laboratory of 
the Heart Institute at Clinics Hospital in Sao Paulo, Brazil, and we recorded 
the sounds in the room while the patients were simultaneously evaluated by 
a full polysomnography exam. 

The population of 17 patients for which sound and polysomnography 
data are available was characterized by a male/female ratio of 11/6, average 
age 50±10 years, average body mass index 30.7±6.7kg/m 2 , average apnea- 
hypopnea index 25.5±22.5events/h. Out of the 17 patients in the present 
study, 4 suffered from mild OSA, 6 from moderate OSA, and 7 from severe 
OSA. 
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The sound-recording system is based on a 16-bit audio codec (PCM 2901, 
Texas Instruments, USA) with a 4th order low-pass Butterworth analog filter 
(cutoff frequency of 5 kHz) and a 4th order high-pass Butterworth analog 
filter (cutoff frequency of 20 Hz). The audio codec is responsible for the 
acquisition (sampling frequency of 44.1kHz) of the sound captured by the 
microphones, and the transmission of the recorded data to a laptop computer 
by a USB serial input. The sound was captured by two microphones located 
at the headboard of the bed, at a distance of 90 cm from each other. 

The microphones acquired the sound S(t) emitted by the patient plus the 
ambient noise, with a total time T, and recorded the data into a microcom- 
puter for later analysis. Since the frequency content of the snore signal is 
predominantly between 20 and 300 Hz, the first step of the analyses was to 
digitally filter the signal, reducing the noise ratio. We found that the most 
suitable filter for dismissing the unwanted ambient noise while still keeping 
a strong snore signal was a 1024th-order band pass filter between 80 and 
300 Hz. 

Following the filtering, we square the filtered signal Sf(t) and sum over 
all [Sf(t)] 2 in a time window At, with a window overlap of 50% between 
consecutive windows, building a new series of sound intensities I m , 

(f +l)At 

Im= E P/(*)] 2 , 
t=f At 

with m going from zero to 2(T / At — 1), i.e., the series {I m } has a total of 
2(T/ At) — 1 elements. We choose At = 1 s, since the normal breathing period 
is approximately 5 seconds, and we found that this discretization gives us a 
good overall precision for our study while keeping the signal less rough. We 
notice that the new series still exhibits considerable background noise, most 
of it coming from air-conditioning and electronic devices used by the Sleep 
Laboratory. From a region in the series without snore, we could determine 
a threshold I below which all sounds are regarded as ambient noise, and 
then postulate that a group of points in our new series defines a single snore 
event when all I m in the group lie above the threshold Jo- According to this 
definition, the jth snore event starts at point m(j) and ends at the first point 
after m(j) for which the corresponding intensity is less than I . Thus, in time 
units the jth snore event will start at (1/2) At m(j). We then define the time 
interval between consecutive snores as the time from the beginning of one 
snore event to the beginning of the next one, 5tj = (1/2) At (m(j)—m(j — l)), 
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as illustrated in figure[IJ 

At the moment, it is necessary to adjust the value of Iq for each patient, 
considering the intensities of the snore and the ambient noise. We built a 
software which is able to read the sound data and process the signal, giving 
us a log file containing the recording time in hours, minutes and seconds, the 
average, median and standard deviation of the Stj, as well as the maximum 
and minimum values of Stj. 

As breathing, snoring is an irregular phenomenon and it is an indicative 
of instability in the upper airways. Snore usually persists at each breathing 
cycle until one of three situations occurs: (i) the obstruction momentarily 
disappears; (ii) there is a partial collapse of the upper airways, restricting the 
airflow and leading to hypopnea; (iii) the upper airway collapses, impairing 
the airflow and leading to an apnea event. We assume that in case (i) this 
momentary situation will persist for more than 100 seconds, and in both cases 
(ii) and (iii), after the patient restores normal breathing, (s)he will either get 
back to snoring or make a noise which will cross the ambient noise threshold 
h- 

We then hypothesize that in a person that snores but does not have sleep 
apnea disorder, most snore time intervals St will be less than 10 s or much 
larger than 100 s. On the other hand, for a patient with OSA we should find 
a lot of snore time intervals between 10 s and 100 s, which in fact we found. 
Thus, in our analyses we focus on the range of time intervals St between 10 s 
and 100 s. 

2.2. The Snore Time Inverval Index 

The Apnea-Hypopnea Index (AHI) is the classical index to determine if a 
person has sleep apnea, and is only fully characterized when a person spends 
a night of sleep in a sleep laboratory wired with electrodes and using an 
air mask to measure air flow. Along the night a specialist medical doctor 
goes through the data and manually counts the number of apnea and hypop- 
nea events Nah occurring during the whole night. The AHI index is then 
evaluated as 

AHI = Nah/T, (2) 

where T is the number of sleep hours. 

From the time intervals St, defined in the previous subsection, we intro- 
duce what we call the Snore Time Interval Index (STII), which is related to 
the number N§ t of snore time intervals for which 10 s < St < 100 s, being 
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explicitly calculated as 

STII = Ngt/T. (3) 

As we will show in section [3j there is a strong positive correlation between 
AHI and STII. 

2.3. A classifier based on Hurst's R/S analysis 

Along the lines of [20], we also built an automated classifier from a fluc- 
tuation analysis of the intensities I m , based on Hurst's R/S method (see e.g. 
|21j). which was introduced as a means of measuring memory effects in a 
time series. The R/S method involves dividing a time series into intervals 
of size r, within which a rescaled range is calculated as follows. For the kth 
interval, which we denote by L^, we first determine the average of the time 
series, 

Z T ,k = ~ ^2 I m , (4) 

which is used to define a cumulative deviation as 

m 

Zm,k = X! (Ij ~ Z r,k), (5) 

£k labeling the left end of L^. From this cumulative deviation, we then extract 
a range, 

Rr tk = max Z m)k - min Z m k , (6) 
which is divided by the standard deviation of the original series, 

St, 



\ 



-E(V ^k)\ (7) 



to yield the rescaled range R r ,k/S Tt k of a single interval. Repeating this 
procedure for all intervals we obtain the average rescaled range 

where n T denotes the number of nonoverlapping intervals of size r that can 
be fit into the original series. 

When applied to a time series generated by a single dynamics, such as 
in fractional Brownian motion, the rescaled-range analysis yields a function 
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p(r) following a power-law with a so-called Hurst exponent H, which gauges 
memory effects on the dynamics. Different values of H indicate different 
underlying dynamics, and this has been used in a number of situations to 
distinguish between the systems producing the time series. Especifically, this 
approach can differentiate between healthy and diseased subjects as regards 
cardiac [22J, neurological [231121], and respiratory function (25] ■ However, as 
shown in the next Section, this characterization in terms of a single parameter 
(the Hurst exponent) fails in the case of OSA, but the p{r) curves still hide 
useful information about the patients conditions. In order to extract that 
information, we resort to pattern-classification tools. 

Given a set {tj} = {ti,t 2 , . . . ,r d } of choices for the interval size r, the 
corresponding set p(r) measures the average relative fluctuation of the time 
series as a function of r, and this can be seen as a vector in a multidimensional 
space, 

x= (p(ti),p(t 2 ) ,...,p(T d )f , 

in which t indicates the vector transpose. The set of all vectors obtained by 
the R/S analysis can then be fed into standard algorithms from the pattern 
classification literature. Here, we employ a variant of the Karhunen-Loeve 
(KL) transformation (see [26J), which can be seen as an improvement over 
principal-component analysis, implementing supervision and relying on the 
compression of discriminatory information contained in the class means. 

In order to build a classifier whose performance is statistically significant, 
we split the intensity signals from each patient into subsignals containing 
1024 points (corresponding to 512 seconds, an amount of time during which 
many snore events can happen), obtaining a total of 936 subsignals for all 
patients. The R/S analysis was performed for values of the time window r 
corresponding to the nearest integers obtained from the powers of 2 1//4 from 
4 to 1024 points (with a total of 33 values). 

The first step of the KL transformation involves projecting the feature 
vectors Xj (i labelling the vectors obtained from different sound samples) 
onto the eigenvectors of the within-class covariance matrix S\y, defined by 

I n c N k 

s w = ^r^J2yik(^i- m k)(^i-m k ) T , (9) 
iV k=\ i=i 

where Nq is the number of different classes, N k is the number of vectors in 
class k, and is the average vector of class k. The element is equal to 
one if Xj belongs to class k, and zero otherwise. The resulting vectors are 
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then rescaled by a diagonal matrix built from the eigenvalues Xj of S^. In 
matrix notation, this operation can be written as 

X' = A^U T X, (10) 

in which X is the matrix whose columns are the training vectors Xj, A = 
diag(Ai, A2, •••)) an d U is the matrix whose columns are the eigenvectors of 
Sw Finally, the resulting vectors are projected onto the eigenvectors of the 
between-class covariance matrix S#, 

Nc N k 

S ^ = H ^( m fc - m )( m fc - m) T , (11) 

k=l 

where m is the overall average vector. The full transformation can be written 

as 

X" = V T A-^U T X, (12) 

V being the matrix whose columns are the eigenvectors of Sb (calculated 
from X'). 



3. Results 

As shown in figure[2j there is a remarkable agreement between the his- 
togram of the duration of apnea or hypopnea events and the histogram of 
snore time intervals. Moreover, our results reveal a strong positive correla- 
tion between the apnea-hypopnea index and the snore-time-interval index, 
as depicted in figure[3j Notice that, assuming a perfect correlation between 
AHI and STII when it comes to classifying OSA into no/mild versus mod- 
erate/severe, the STII yields only one false negative and two false positive 
results (false positives being usually more tolerable mistakes). 

Figure [i] shows examples of curves obtained by Hurst's R/S analysis of 
sound samples taken from four different patients. Estimates of the Hurst 
exponents of the different curves for all patients, obtained by linear fits of 
the log-log curves, yielded results ranging between 0.7 and 0.8, with no sig- 
nificance correlation to the patients condition. As described in the previous 
section, the Hurst curves, interpreted as feature vectors, were then fed into 
the automated classifier based on the KL transformation. The classifier was 
trained to recognize feature vectors of sound samples coming from two cate- 
gories: (a) patients with no or mild OSA; (b) patients suffering from moderate 
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Figure 2: The histogram of time intervals At between consecutive snores (square) and 
the histogram of apnea-hypopneia duration (circles) in seconds. We compare two extreme 
cases, filled and open symbols respectively for moderate and severe OSA. 



or severe OSA. The training was performed by randomly selecting 748 out of 
the available 936 subsignals (about 80 percent of the total), and using the re- 
maining signals for testing. During the training step, the mean transformed 
vectors of each class are extracted, and the (transformed) testing vectors are 
assigned to the class corresponding to the nearest mean transformed vector. 
With only two possible classes, the transformation matrix effectively reduces 
to a single line, i.e. a transformation vector [j] The inset in figure [4] shows, 
for a typical transformation vector, the (unnormalized) weights of the com- 
ponents corresponding to different sizes of the time window. The peaks in 
the curve indicate those components which are most relevant for the decision 
process. Notice the existence of large peaks around r = 20 (logr ~ 1.3), cor- 
responding to a time interval of 10, s, a time interval characteristic of many 
snore events in patients suffering from OSA. 

This procedure was repeated for 100 distinct choices of training and test- 



: This is a consequence of the fact that the subspace defined by the relative positions 



of the average vectors of the two classes, entering the definition of Sb in (11), is one- 
dimensional. 
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Apnea Hypopneia Index (AHI) 



Figure 3: Snore Time Interval Index versus Apnea Hypopneia Index (AHI), the latter 
defined by the polysomnography for 17 patients. The straight line is a line with slope 1 
to highlight the correlation between the two indexes (correlation coefficient of 0.841, with 
significance t = 6.02, indicating a probability of less than 0.1% for the null hypothesis 
of uncorrelated data). As defined by the AHI value, the dark grey region corresponds to 
severe OSA, the light grey region to moderate OSA, the lightest grey region to mild OSA, 
and the white region to normality. The dashed lines delimit the boundaries of the regions 
where the STII leads to false positive (upper left rectangle) or false negative (lower right 
rectangle) results, according to the classification in no/mild versus moderate/severe OSA. 
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Figure 4: Main plot: examples of curves obtained by Hurst's R/S analysis of the snore 
sounds, corresponding to the average relative fluctuation of the signal, p, versus the size of 
the time window, t, in double-logarithmic scale. Sound samples from 4 different patients 
were used, two with no or mild apnea, and two with moderate or severe apnea. Inset: 
Weights of the components of the Karhunen-Loeve transformation vector corresponding 
to the different sizes of the time window. Projection of the fluctuation vector of a given 
sound sample onto the Karhunen-Loeve vector determines the class to which the sample 
is attributed. 
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ing vectors. Based on each subsignal individually, the average classification 
error during the testing step was 37% for category (a) and 28% for category 
(b). However, if one extracts a classification from a majority rule of the cor- 
responding results for all subsignals belonging to a given patient, 16 of the 17 
patients are correctly assigned to their respective categories. Interestingly, 
the misclassified patient has an AHI of 15.2, and thus lies on the borderline 
between categories. 

4. Conclusions 

To summarize, in the present study we propose a simple and accurate 
method to identify OSA based on the time intervals between snore events, 
and the average number of snore events per unit time, as revealed by the 
intensity of the sound. Our method of analyzing snore correlated remarkably 
well with the AHI derived from polysomnography, which is the gold standard 
method for OSA diagnosis. Moreover, an automated classifier based on the 
R/S analysis of the snore intensities was able to correctly identify patients 
with moderate to severe OSA. 

Of course, our study has some limitations. Firstly, we tested this algo- 
rithm in a relatively small sample of patients referred to the sleep laboratory 
under controlled conditions. We have no means to predict at this stage how 
this method will perform in other settings and populations. However, it 
must be stressed that the physical conditions of the laboratory are probably 
similar to most bedrooms. We acknowledge that due to the simplicity of 
our snore detection scheme, some snore events may not be detected or some 
loud sound may be detected as a snore. However, these false or undetected 
snore events are expected to represent a small fraction of all events, and 
thus will be washed out in the statistics. We predict that more sophisti- 
cated methods to detect snores may improve the snore-time-interval index. 
Nevertheless, our work represents a first step towards establishing guidelines 
for snore classification and revealing a correlation between snoring events 
suspected of sleep-disordered breathing and the AHI, goals manifested in a 
recent medical review on the acoustics of snoring [27]. 
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